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Abstract 

We propose a modified time lag random matrix theory in order to study time lag cross- 
correlations in multiple time series. We apply the method to 48 world indices, one for each of 
48 different countries. We find long-range power-law cross-correlations in the absolute values of re- 
turns that quantify risk, and find that they decay much more slowly than cross-correlations between 
the returns. The magnitude of the cross-correlations constitute "bad news" for international invest- 
ment managers who may believe that risk is reduced by diversifying across countries. We find that 
when a market shock is transmitted around the world, the risk decays very slowly. We explain these 
time lag cross-correlations by introducing a global factor model (GFM) in which all index returns 
fluctuate in response to a single global factor. For each pair of individual time series of returns, 
the cross-correlations between returns (or magnitudes) can be modeled with the auto-correlations 
of the global factor returns (or magnitudes). We estimate the global factor using principal compo- 
nent analysis, which minimizes the variance of the residuals after removing the global trend. Using 
random matrix theory, a significant fraction of the world index cross-correlations can be explained 
by the global factor, which supports the utility of the GFM. We demonstrate applications of the 
GFM in forecasting risks at the world level, and in finding uncorrelated individual indices. We find 
10 indices are practically uncorrelated with the global factor and with the remainder of the world 
indices, which is relevant information for world managers in reducing their portfolio risk. Finally, 
we argue that this general method can be applied to a wide range of phenomena in which time 
series are measured, ranging from seismology and physiology to atmospheric geophysics. 

PACS numbers: PACS numbers:89.65.Gh, 89.20.-a, 02.50.Ey 
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I. INTRODUCTION 



When complex systems join to form even more complex systems, the interaction of 



the constituent subsystems is highly random 



^MiM- The complex stochastic interactions 



among these subsystems are commonly quantified by calculating the cross-correlations. This 
method has been applied in systems ranging from nanodevices |5|-l7|, atmospheric geophysics 
8|, and seismology o]-!!], to finance 12-23|. Here we propose a method of estimating the 
most significant component in explaining long-range cross-correlations. 

Studying cross-correlations in these diverse physical systems provides insight into the 
dynamics of natural systems and enables us to base our prediction of future outcomes on 

i estimate on cross-correlation matrices 



In seismology, cross-correlation lev- 



current information. In finance, we base our risl 
derived from asset and investment portfolios [is], 
els are used to predict earthquake probability and intensity [10|. In nanodevices used in 
quantum information processing, electronic entanglement necessitates the computation of 
noise cross-correlations in order to determine whether the sign of the signal will be re- 
versed when compared to standard devices 5|. Reference 25|] reports that cross-correlations 
for At = calculated between pairs of EEG time series are inversely related to dissociative 
symptoms (psychometric measures) in 58 patients with paranoid schizophrenia. In genomics 



data, Ref. 



26| reports spatial cross-correlations corresponding to a chromosomal distance of 



^ 10 million base pairs. In physiology, Ref. 26|] reports a statistically significant difference 
between alcoholic and control subjects. 

Many methods have been used to investigate cross-correlations (i) between pairs of si- 



multaneously recorded time series 
recorded time series 



21 



15 



27 



22j or 



Reference 



ii) among a large number of simultaneously- 
uses a power mapping of the elements in the 



correlation matrix that suppresses noise. Reference 2l| proposes detrended cross-correlation 
analysis (DCCA), which is an extension of detrended fluctuation analysis (DFA) 29| and is 
based on detrended covariance. Reference j^] proposes a method for estimating the cross- 
correlation function Cxy of long-range correlated series Xt and yt. For fractional Brownian 
motions with Hurst exponents Hi and H2, the asymptotic expression for C^y scales as a 
power of n with exponents Hi and H2. 

Univariate (single) financial time series modeling has long been a popular technique in 
science. To model the auto-correlation of univariate time series, traditional time series 
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models such as autoregressive moving average (ARMA) models have been proposed 30 |. 
The ARMA model assumes variances are constant with time. However, empirical stud- 
ies accomplished on financial time series commonly show that variances change with time. 
To model time- varying variance, the autoregressive conditional heteroskedasticity (ARCH) 
model was proposed 3l|]- Since then, many extensions of ARCH has been proposed, includ- 
ing the generalized autoregressive conditional heteroskedasticity (GARCH) model (32| and 
;he fractionally- integrated autoregressive conditional heteroskedasticity (FIARCH) model 



33|. In these models, long-range auto-correlations in magnitudes exist, so a large price 



change at one observation is expected to be followed by a large price change at the next 
observation. Long-range auto-correlations in magnitude of signals have been reported in 



finance 



33| . physiology |3J, |35|, river fiow data |36|, and weather data |37 |. 



Besides univariate time series models, modeling correlations in multiple time series has 
been an important objective because of its practical importance in finance, especially in port- 



folio selection and risk management 



38 



39| . In order to capture potential cross-correlations 



among different time series, models for coupled heteroskedastic time series have been in- 



troduced 



40 



42]. However, in practice, when those models are employed, the number of 



parameters to be estimated can be quite large. 

A number of researchers have applied multiple time series analysis to world indices. 



12j reported that for 



mainly in order to analyze zero time-lag cross-correlations. Reference 
international stock return of nine highly-developed economies, the cross-correlations between 
each pair of stock returns fiuctuate strongly with time, and increase in periods of high market 
volatility. By volatility we mean time-dependent standard deviation of return. The finding 
that there is a link between zero time lag cross-correlations and market volatility is "bad 
news" for global money managers who typically reduce their risk by diversifying stocks 
throughout the world. In order to determine whether financial crises are short-lived or 
long-lived, Ref. 43| recently reported that, for six Latin American markets, the effects of a 
financial crisis are short-range. Between two and four months after each crisis, each Latin 
American market returns to a low-volatility regime. 

In order to determine whether financial crisis are short-term or long-term at the world 
level, we study 48 world indices, one for each of 48 different countries. We analyze cross- 
correlations among returns and magnitudes, for zero and non-zero time lags. We find that 
cross-correlations between magnitudes last substantially longer than between the returns. 
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similar to the properties of auto-correlations in stock market returns 4J]. We propose a 
general method in order to extract the most important factors controllinsr cross-correlations 
,„ t,„,e senes. Based on ...do. ,„at.x tKeo,v and pr.ncpa. component analysis Q we 
propose how to estimate the global factor and the most significant principal components in 
explaining the cross-correlations. This new method has a potential to be broadly applied in 
diverse phenomena where time series are measured, ranging from seismology to atmospheric 
geophysics. 

This paper is organized as follows. In Section II we introduce the data analyzed, and the 
definition of return and magnitude of return. In Section III we introduce a new modified time 
lag random matrix theory (TLRMT) to show the time-lag cross-correlations between the 
returns and magnitudes of world indices. Empirical results show that the cross-correlations 
between magnitudes decays slower than that between returns. In Section IV we introduce a 
single global factor model to explain the short- or long-range correlations among returns or 
magnitudes. The model relates the time-lag cross-correlations among individual indices with 
the auto- correlation function of the global factor. In Section V we estimate the global factor 
by minimizing the variance of residuals using principal component analysis (PCA), and we 
show that the global factor does in fact account for a large percentage of the total variance 
using RMT. In Section VI we show the applications of the global factor model, including 
risk forecasting of world economy, and finding countries who have most the independent 
economies. 



II. DATA ANALYZED 



In order to estimate the level of relationship between individual stock markets — either 
long-range or short-range cross-correlations exist at the world level — we analyze = 48 
world-wide financial indices, Si^t, where i = 1,2, ...,48 denotes the financial index and 
t denotes the time. We analyze one index for each of 48 different countries: 25 European 

including Australia and New Zealand) 46| , 2 American indices 



maices 



45| . 15 Asian indices 



471], and 4 African indices 48|. In studying 48 economies that include both developed 
and developing markets we significantly extend previous studies in which only developed 
economies were included — -^e^., the seven economies analyzed in Refs. [l^ 49|, and the 17 



countries studied in Ref. 50|. We use daily stock- index data taken from Bloomberg, as 
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opposed to weekly js^ or monthly data [l2|. The data cover the period 4 Jan 1999 through 
10 July 2009, 2745 trading days. For each index Si^t, we define the relative index change 
(return) as 

Ri^t = log Si^t - log Si^t-i, (1) 

where t denotes the time, in the unit of one day. By magnitude of return we denote the 
absolute value of return after removing the mean 



\R 



(2) 



III. MODIFIED TIME-LAG RANDOM MATRIX THEORY 



A. Basic Ideas of Time-lag Random Matrix Theory 



In order to quantify the cross-correlations, random matrix theory (RMT) (see Refs. 51 1 



52] and references therein) was proposed in order to analyze collective phenomena in nu- 



clear physics. Refs. |15i] extended RMT to cross-correlation matrices in order to find cross- 
correlations in collective behavior of financial time series. The largest eigenvalue A+ and 
smallest eigenvalue A_ of the Wishart matrix W (a correlation matrix of uncorrelated time 
series with finite length) are 

where Q = T/N{> 1), and is the matrix dimension and T the length of each time series. 
The larger the discrepancy between (a) the correlation matrix C between empirical time series 
and (b) the Wishart matrix W obtained between uncorrelated time series, the stronger are 
the cross-correlations in empirical data 15|. Many RMT studies reported equal-time (zero 
At) cross-correlations between different empirical time series 

Recently time-lag generalizations of RMT have been proposed 
generalizations of RMT, based on the eigenvalue spectrum called time-lag RMT (TLRMT), 
Ref. [26| found long-range cross-correlations in time series of price fluctuations in absolute 
values of 1340 members of the New York Stock Exchange Composite, in both healthy and 
pathological physiological time series, and in the mouse genome. 

We compute for varying time lags At the largest singular values A/, (At) of the cross- 



53^561 
57 



59| . In one of the 
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correlation matrix of N- variable time series X 



aiaj 



We also compute XilAt) of a similar matrix C{At), where Xi^t are replaced by the mag- 
nitudes \Xi^t\- The squares of the non-zero singular values of C are equal to the non-zero 
eigenvalues of CC"*" or where by C+ we denote the transpose of C. In a singular value 



decomposition (SVD) [26|, |59|, |60| C = UDV^ the diagonal elements of D are equal to singu- 
lar values of C, where the U and V correspond to the left and right singular vectors of the 
corresponding singular values. We apply SVD to the correlation matrix for each time lag 
and calculate the singular values, and the dependence of the largest singular value XiiAt) 
on At serves to estimate the functional dependence of the collective behavior of Cij on At 



26| 



B. Modifications of Cross-Correlation Matrices 

We make two modifications of correlation matrices in order to better describe correlations 
for both zero and non-zero time lags. 

(i) The first modification is a correction for correlation between indices that are not 
frequently traded. Since different countries have different holidays, all indices contain 
a large number of zeros in their returns. These zeros lead us to underestimate the 
magnitude of the correlations. To correct for this problem, we define a modified cross- 
correlation between those time series with extraneous zeros, 

T' aiCj 
Here T' is the time period during which both Xj ^ and Xj t+At are non-zero. With this 
definition, the time periods during which Xj ^ or Xj^t+At exhibit zero values have been 
removed from the calculation of cross-correlations. The relationship between C-j(At) 
and Cjj(At) is 

C^(At) = ^Q,{At). (6) 

(ii) The second modification corrects for auto-correlations. The main diagonal elements in 
the correlation matrix are ones for zero-lag correlation matrices and auto-correlations 



for non-zero lag correlation matrices. Thus, time-lag correlation matrices allow us to 
study both auto-correlations and time-lag cross-correlations. If we study the decay of 
the largest singular value, we see a long-range decay pattern if there are long-range 
auto-correlations for some indices but no cross-correlation between indices. To remove 
the influence of auto-correlations and isolate time-lag cross-correlations, we replace 
the main diagonals by unity, 

{1 when i = j 

(7) 
q,.(At) when ^ ^ 

With this definition the influence of auto-correlations is removed, and the trace is kept 
the same as the zero time-lag correlation matrix. 

C. Empirical Results 

In Fig. 1(a) we show the distribution of cross-correlations between zero and non-zero lags. 
For At = the empirical pdf P(Cjj) of the cross-correlation coefficients C^- substantially 
deviates from the corresponding pdf P{Wij) of a Wishart matrix, implying the existence of 
equal-time cross-correlations. 

In order to determine whether short-range or long-range cross-correlations accurately 
characterize world financial markets, we next analyze cross-correlations for {At ^ 0). We 
find that with increasing At the form of P{Cij) quickly approaches the pdf P{Wij), which 



is normally distributed with zero mean and standard deviation l/y/N [61 1. 

In Fig. 1(b) we also show the distribution of cross-correlations between magnitudes. In fi- 
nancial data, returns Ri^t are generally uncorrelated or short-range auto-correlated, whereas 
the magnitudes are generally long-range auto-correlated [33|,l62|. We thus examine the cross- 
correlations Cij{At) between |rjj| for different At. In Fig. 1(b) we find that with increasing 
At, P{Cij) approaches the pdf of random matrix P{Wij) more slowly than P{Cij), imply- 
ing that cross-correlations between index magnitudes persist longer than cross-correlations 
between index returns. 

In order to demonstrate the decay of cross-correlations with time lags, we apply modified 
TLRMT. Fig. 2 shows that with increasing At the largest singular value calculated for 
C decays more slowly than the largest singular value calculated for C. This result implies 
that among world indices, the cross-correlations between magnitudes last longer than cross- 
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correlations between returns. In Fig. 2 we find that vs. At decays as a power law function 
with the scaling exponent equal to 0.25. The faster decay of A/, vs. At for C implies very 
weak (or zero) cross-correlations among world-index returns for larger At, which agrees with 
the empirical finding that world indices are often uncorrelated in returns. Our findings of 
long-range cross-correlations in magnitudes among the world indices is, besides a finding in 
Ref. [3], another piece of "bad news" for international investment managers. World market 
risk decays very slowly. Once the volatility (risk) is transmitted across the world, the risk 
lasts a long time. 



IV. GLOBAL FACTOR MODEL 



The arbitrage pricing theory states that asset returns follow a linear combination of 
various factors [633 . that the factor structure can also model time lag pairwise 

cross-correlations between the returns and between magnitudes. To simplify the structure, 
we model the time lag cross-correlations with the assumption that each individual index 
fluctuates in response to one common process, the "global factor" Mt, 

Ri,t = l^i + biMt + ei,t. (8) 

Here in the global factor model (GFM), /Xj is the average return for index i, Mt is the 
global factor, and ei^t is the linear regression residual, which is independent of Mf, with 
mean zero and standard deviation cij. Here bi indicates the covariance between Ri^t and 
Mf, CoyiRi t, Mf) = biVai{Mt). This single factor model is similar to the Sharpe market 



model 6^, but instead of using a known financial index as the global factor Mt, we use 
factor analysis to find Mt, which we introduce in the next section. We also choose Mf as 
a zero-mean process, so the expected return E{Ri t) = fit, and the global factor Mt is only 
related with market risk. We define a zero-mean process j as 

Ti^t = Ri,t - E{Ri^t) = biMt + ei,t. (9) 

A second assumption is that the global factor can account for most of the correlations. 
Therefore we can assume that there are no correlations between the residuals of each index, 
Cov(e i,t,'^j,t) = 0. Then the covariance between Ri^t and Rj^t is 

Cov(i?,,i, R^^t) = Cov(r,,t, r,,t) = hb,Yai{Mt). (10) 



The covariance between magnitudes of returns depends on the return distribution of Mt 
and Ri^t, but the covariance between squared magnitudes rf ^ indicates the properties of the 
magnitude cross- correlations. The covariance between rf^^ and is 

Covirl,r%) = bjb'^Va.r{M^). (11) 

The above results in Eqs. (fTOi) - (fTT!) show that the variance of the global factor and square 
of the global factor account for all the zero time lag covariance between returns and squared 
magnitudes. For time lag covariance between r^^t, we find 

Cov(ri,t,rj-t, At) = E{ri^t,rj^t~At) - E{ri^t)E{rj^t-At) (12) 
= b,b,AM{At). (13) 

Here 

AM{At) = E{MtMt-At) - E{Mt)E{Mt^At) (M) 
is the autocovariance of Mt. Similarly, we find 

Covirl,rl,At) = by^AM2{At). (15) 

Here 

A,j2iAt) = EiMfMl^t) - E{M^)E{Ml^t) (16) 

is the autocovariance of M^. 

In GFM, the time lag covariance between each pair of indices is proportional to the 
autocovariance of the global factor. For example, if there is short-range autocovariance for 
Mt and long-range autocovariance for Mf, then for individual indices the cross- covariance 
between returns will be short-range and the cross-covariance between magnitudes will be 
long-range. Therefore, the properties of time-lag cross-correlation in multiple time series 
can be modeled with a single time series — the global factor Mt. 

The relationship between time lag covariance among two index returns and autocovariance 
of the global factor also holds for the relationship between time lag cross-correlations among 
two index returns and auto-correlation function of the global factor, because it only need to 
normalize the original time series to mean zero and standard deviation one. 
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V. ESTIMATION AND ANALYSIS OF THE GLOBAL FACTOR 



A. Estimation of the Global factor 

In contrast to domestic markets, where for a given country we can choose the stock index 
as an estimator of the "global" factor, when we study world markets the global factor is 
unobservable. At the world level when we study cross-correlations among world markets, 



we estimate the global factor using principal component analysis (PCA) 27 1. 

In this section we use bold font for N dimensional vectors or NxN matrix, and underscore 
t for time series. Suppose Rt = {Ri,t, R2,t, ■ ■ ■ , RN,tY is the multiple time series, each row 
of which is an individual time series Ri^t = -Ri,2, • • • , Ri^r)- We standardize each time 
series to zero mean and standard deviation 1 as 

_ Rj^t - {Ri,t) 

The correlation matrix can be calculated as C = ^ztZt^ where zt^ is the transpose of 
Zt, and the T in the denominator is the length of each time series. Then we diagonalize the 
NxN correlation matrix C 

C = UAU^. (18) 

Here A = diag(Ai, A2, Xn) and Ai > A2 > ... > Xn are the eigenvalues in non- increasing 
order, U is an orthonormal matrix, whose i-th column is the basis eigenvector Ui of C, and 
is the transpose of U, which is equal to U^^ because of orthonormality. 
For each eigenvalue and the corresponding eigenvector, it holds 

Xi = Ui'^Cui = Ui'^Cov(zt)ui = Var(ui'^zt) = Var (ai^t)- (19) 

According to PCA, ai^t = Ui^zt is defined as the i-th principal component (a^.t), and the 
eigenvalue Aj = Var(zj 4) indicates the portion of total variance of Zt contributed to ttj^t, as 
shown in Eq. (fT9|) . Since the total variance of zt is 

N N 

J2 Var(z,,i) = trace(C) = ^ A„ (20) 

i=l i=l 

the expression Aj/trace(C) indicates the percentage of the total variance of zt that can be 
explained by the ai^f According to PCA (a) the principal components ai^t are uncorrelated 
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with each other and (b) ai^t maximizes the variance of the hnear combination of U^zt with 
the orthonormal restriction U^U = 1 given the previous principal components j27 |. 
From the orthonormal property of U we obtain 

I = UU^ = uiu^ + U2U^ + ... + u^u^, (21) 

where I is the identity matrix. Then the multiple time series Zt can be represented as a 
linear combination of all the cxt 

Zt = (Uiuf + U2U2 + ... + UArU^)Zt 

= uiai^t + U2a2,t + ••• + UNaN,t- (22) 

The total variance of all time series can be proved to be equal to the total variance of all 
principal components 

N 

^^Var(zjt) = Var(ui)a;i^f + ... + Var(u7v)aAf,t (23) 

i=l 

N N 

= ^ uf UiVar(ai,t) = ^Var(a,,t). (24) 

j=i 1=1 

Next we assume that Var(ai^i) = Ai is much larger than each of the rest of eigenvalues — 
which means that the first ck^, ai^t, accounts for most of the total variances of all the time 
series. We express zt as the sum of the first part of Eq. fl22|) corresponding to ai^t and the 
error term combined from all other terms in Eq. (122|) . Thus 

Zt = uiai^t + rit, 

N 

rit = ^Uitti^i. (25) 

1=2 

Then ai^t is a good approximation of the global factor Mj , because it is a linear combination 
of Ri^t that accounts for the largest amount of the variance, ai is a zero-mean process 
because it is a linear combination of Zi^t which are also zero-mean processes (see Eq. ([1] 
Comparing Eqs. f lT7|) and fl25l) with 



Ri,t = l^i + hMt + e,,t, (26) 

we find the following estimates: 

Mt = ai^t, 
hi = a{Ri)uii, 

ei,t = cT{Ri)r]i,t. (27) 
12 



Using Eq. (fTOjl we find that 

Con {Mt, Ri^t) = \/\iUii. 
In the rest of this work, we apply the method of Eq. flTTI) to empirical data. 



(28) 



B. Analysis of the global factor 

Next we apply the method of Eq. fl27|) to estimate the global factor of 48 world in- 
dex returns. We calculate the auto-correlations of Mj and \Mt\, which are shown in Figs. 
3 and 4. Precisely, for the world indices. Fig. 3(a) shows the time series of the global 
factor Mti and Fig. 3(b) shows the auto-correlations in Mj. We find only short-range auto- 
correlations because, after an interval At = 2, most auto-correlations in Mt fall in the 
range of (— 1.96a/1/T, 1.96^1/T) [61], which is the 95% confidence interval for zero auto- 
correlations. Here T = 2744. 

For the 48 world index returns. Fig. 4(a) shows the time series of magnitudes |M(|, 
with few clusters related to market shocks during which the market becomes fiuctuates 
more. Fig. 4(b) shows that, in contrast to Mt, the magnitudes \Mt\ exhibit long-range auto- 
correlations since the values \Mt\ are significant even after At = 100. The auto-correlation 
properties of the global factor are the same as the auto-correlation properties of the indi- 
vidual indices, i.e., there are short-range auto-correlations in Mt and long-range power-law 



auto-correlations in \Mt 



33 



62l |. These results are also in agreement with Fig. 1(b) where 
the largest singular value A^, vs. At calculated for C decays more slowly than the largest 
singular value calculated for C. As found in Ref. [26,] for At >> 1, AL(At) approximately 
follows the same decay pattern as cross-correlation functions. Although a Ljung-Box test 
shows that the return auto-correlation is significant for a 95% confidence level [Gsl , the return 
auto-correlation is only 0.132 for At = 1 and becomes insignificant after At = 2 . Therefore, 
for simplicity, we only consider magnitude cross-correlations in modeling the global factor. 
We model the long-range market-factor returns M with a particular version of the GARCH 



process, the GJR GARCH process 66|, because this GARCH version explains well the asym- 



metry in volatilities found in many world indices 
written as 



66|-|68j. The GJR GARCH model can be 



et = (TtVt, (29) 
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= «o + + lTt^^)eli + J2 (30) 

i=l i=l 

where at is the volatihty and rjt is a random process with a Gaussian distribution with 
standard deviation 1 and mean 0. The coefficients a and f3 are determined by a maximum 
hkehhood estimation (MLE) and Tj = 1 if ef_i < 0, = if ef_i > 0. We expect 
the parameter 7 to be positive, implying that "bad news" (negative increments) increases 
volatihty more than "good news" . For the sake of simplicity, we follow the usual procedure 
of setting p = g = 1 in all numerical simulations. In this case, the GJR-GARCH(1,1) model 
for the market factor can be written as 

Mt = atvu (31) 
= ao + (ai + lTt_^)el, + f3,al,. (32) 

We estimate the coefficients in the above equations using MLE, where the estimated coeffi- 
cients are shown in Table. 1. 

Next we test the hypothesis that a significant percentage of the world cross-correlations 
can be explained by the global factor. By using PGA we find that the global factor can 
account for 30.75% of the total variance. Note that, according to RMT, only the eigenval- 
ues larger than the largest eigenvalue of a Wishart matrix calculated by Eq. (and the 
corresponding cx.s) are significant. To calculate the percentage of variance the significant cx.s 



account for, we employ the RMT approach proposed in Ref. [15|. The largest eigenvalue for 
a Wishart matrix is A+ = 1.282 for = 48 and T = 2744 we found in the empirical data. 
From all the 48 eigenvalues, only the ffist three are significant: Ai = 14.762, A2 = 3.453, and 
A3 = 1.380. This result implies that among the significant factors, the global factor accounts 
for Ai/ J2i=i = 75.34% of the variance, conffiming our hypothesis that the global factor 
accounts for most variance of all individual index returns. 

PGA is defined to estimate the percentage of variance the global factor can account for 
zero time lag correlations. Next we study the time lag cross-correlations after removing the 
global trend, and apply the SVD to the correlation matrix of regression residuals rji of each 
index [see Eq. ([8])]. Our results show that for both returns and magnitudes, the remaining 
cross- correlations are very small for all time lags compared to cross-correlations obtained for 
the original time series. This result additionally conffims that a large fraction of the world 
cross- correlations for both returns and magnitudes can be explained by the global factor. 
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VI. APPLICATIONS OF GLOBAL FACTOR MODEL 



A. Locating and Forecasting global risks 

The asymptotic (unconditional) variance for the GJR-GARCH model is ao/(l — tti — /3i — 
7/2) = 10.190 [g^. For the market factor the conditional volatility at can be estimated 
by recursion using the historical conditional volatilities and fitted coefficients in Eq. (!32l) . 
For example, the largest cluster at the end of the graph shows the 2008 financial crisis. 
In Fig. 5(a) we show the time series of the conditional volatility of Eq. (132!) of the global 
factor. The clusters in the conditional volatilities may serve to predict market crashes. In 
each cluster, the height is a measure of the size of the market crash, and the width indicates 
its duration. In Fig. 5(b) we show the forecasting of the conditional volatility of the global 
factor, which asymptotically converges to the unconditional volatility. 

B. Finding uncorrelated individual indices 

Next, in Fig. |6]we show the cross-correlations between the global factor and each individ- 
ual index using Eq. f l28|) . There are indices for which cross-correlations with the global factor 
are very small compared to the other indices; 10 of 48 indices have cross- correlations coeffi- 
cients with the global factor smaller than 0.1. These indices correspond to Iceland, Malta, 
Nigeria, Kenya, Israel, Oman, Qatar, Pakistan, Sri Lanka, and Mongolia. The financial mar- 
ket of each of these countries is weakly bond with financial markets of other countries. This 
is useful information for investment managers because one can reduce the risk by investing 
in these countries during world market crashes which, seems, do not severely infiuence these 
countries. 

VII. DISCUSSION 

We have developed a modified time lag random matrix theory (TLRMT) in order to 
quantify the time-lag cross-correlations among multiple time series. Applying the mod- 
ified TLRMT to the daily data for 48 world-wide financial indices, we find short-range 
cross-correlations between the returns, and long-range cross-correlations between their mag- 
nitudes. The magnitude cross-correlations show a power law decay with time lag, and the 
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scaling exponent is 0.25. The result we obtain, that at the world level the cross-correlations 
between the magnitudes are long-range, is potentially significant because it implies that 
strong market crashes introduced at one place have an extended duration elsewhere — which 
is "bad news" for international investment managers who imagine that diversification across 
countries reduces risk. 

We model long-range world-index cross-correlations by introducing a global factor model 
in which the time lag cross-correlations between returns (magnitudes) can be explained by 
the auto-correlations of the returns (magnitudes) of the global factor. We estimate the 
global factor as the first component by using principal component analysis. Using random 
matrix theory, we find that only three principal components arc significant in explaining the 
cross-correlations. The global factor accounts for 30.75% of the total variance of all index 
returns, and 75.34% of the variance of the three significant principle components. Therefore, 
in most cases, a single global factor is sufficient. 

We also show the applications of the GFM, including locating and forecasting world risk, 
and finding individual indices that are weakly correlated to the world economy. Locating and 
forecasting world risk can be realized by fitting the global factor using a GJR-GARCH(1,1) 
model, which explains both the volatility correlations and the asymmetry in the volatility 
response to both "good news" and "bad news." The conditional volatilities calculated after 
fitting the GJR-GARCH(1,1) model indicates the global risk, and the risk can be forecasted 
by recursion using the historical conditional volatihties and the fitted coefficients. To find 
the indices that are weakly correlated to the world economy, we calculate the correlation 
between the global factor and each individual index. We find 10 indices which have a 
correlation smaller than 0.1, while most indices are strongly correlated to the global factor 
with the correlations larger than 0.3. To reduce risk, investment managers can increase 
the proportion of investment in these countries during world market crashes, which do not 
severely infiuence these countries. 

Based on principal component analysis, we propose a general method which helps extract 
the most significant components in explaining long-range cross-correlations. This makes the 
method suitable for broad range of phenomena where time series are measured, ranging from 
seismology and physiology to atmospheric geophysics. We expect that the cross-correlations 
in EEG signals are dominated by the small number of most significant components control- 
hng the cross-correlations. We speculate that cross-correlations in earthquake data are also 
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controlled by some major components. Thus the method may have significant predictive 
and diagnostic power that could prove useful in a wide range of scientific fields. 

We thank Ivo Grosse and T. Preis for valuable discussions and NSF for financial support. 
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TABLE I: GJR-G ARCH (1,1) coefficients of the global factor. The P-values and t-values comfirms 
that all these parameters are significant at 95% confidence level. The positive value of 7 means "bad 
news" has larger impact on the global market than "good news". We find ai +7/2 = 0.9756, 
which is very close to 1, and so indicate long-range volatility auto-correlations. 





Value 


Std. Error t- value P- value 




0.2486 


0.0283 


8.789 0.0000 


ai 


0.0170 


0.0080 


2.128 0.0334 


Pi 


0.8790 


0.0101 


86.939 0.0000 


7 


0.1591 


0.0148 


10.805 0.0000 
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FIG. 1: Cross-correlations among the = 48 world financial index returns each of size T = 2744 
(a) The empirical pdf of the coefficients of the cross-correlation matrix C calculated between index 
returns with increasing At quickly converges to the Gaussian form. The normal distribution is the 
distribution of the pairwise cross-correlations for finite length uncorrelated time series, which is a 
normal distribution with mean zero and standard deviation between (b) The empirical pdf 
of the coefficients of the matrix C calculated between index volatilities approaches the pdf of the 
random matrix more slowly than in (a). 
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FIG. 2: Long-range magnitude cross-correlations. The largest singular value \l obtained from the 
spectrum of the matrices C and C versus time lag At. With increasing At, the largest singular values 
obtained for C of returns decays more quickly than C calculated for absolute values of returns. The 
magnitude cross-correlations decay as a power law function with the scaling exponent of k. 0.25. 
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FIG. 3: Short-range cross-correlations of a global factor, (a) Time series of the global factor, (b) 
The auto-correlation function (ACF) of the global factor. The region between dashed lines is the 
95% confidence interval for the no auto-correlation hypothesis. Auto-correlations are smaller than 
0.132 except At = 0, and become insignificant after time lag At = 2, with no more than one 
significant auto-correlation for every 20 time lags. Therefore, only short-range auto-correlations 
can be found in the global factor. 
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FIG. 4: Long-range cross-correlations of the magnitude global factor, (a) Time series of magnitudes 
of the global factor, (b) Auto-correlations of magnitudes of the global factor. The region between 
dashed lines is the 95% confidence interval for the no auto-correlation hypothesis. Auto-correlations 
are much larger than the auto-correlations of the global factor itself, is as large as 0.359 at At = 2, 
and is still larger than 0.2 until At = 33. For every time lag, the autocorrelation is significant 
even after At = 100. Therefore long-range auto-correlations exist in the magnitudes of the global 
factor. 
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FIG. 5: (a) Conditional volatility of the global factor, showing that the clusters in the conditional 
volatilities may serve to predict market crashes. In each cluster, the height indicates the size of 
the market crash, and the width indicates its duration, (b) The 100-day forecasted volatility of the 
global factor, using the past data ranging from 4 Jan 1999 through 10 July 2009. It will converge 
to the unconditional volatility asymptotically. 
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FIG. 6: Cross-correlation between the global factor Mt and each individual index Ri^f, i = 
1,2, ...,48. The global factor has large correlation with most of the indices. However, there are 
indices that are not much correlated with the global factor. 10 of the 48 indices have a correlation 
smaller than 0.1 between the global factor, corresponding to the indices for Iceland, Malta, Nigeria, 
Kenya, Israel, Oman, Qatar, Pakistan, Sri Lanka, and Mongolia. Hence, unlike most countries, the 
economies of these 10 countries are more independent of the world economy. 
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